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ABSTRACT 


This  paper  is  concerned  with  joint  test  of  non-nested  models  and  simultaneous  departures  from 
homoskedasticity,  serial  independence  and  normality  of  the  disturbance  terms.  Locally  equivalent 
alternative  models  are  used  to  construct  joint  tests  since  they  provide  a  convenient  way  to  incorporate 
more  than  one  type  of  departures  from  the  classical  conditions,  the  joint  tests  represent  a  simple 
asymptotic  solution  to  the  "pre-testing"  problem  in  the  context  of  non-nested  linear  regression  models. 
Our  simulation  results  indicate  that  the  proposed  tests  have  good  finite  sample  properties. 

Key  Words  and  Phrases:  locally  equivalent  alternative  models;  non-normal  errors;  non-spherical  errors; 
pre-testing  problem;  simulation  study 


1.  Introduction 

In  recent  years  a  substantial  literature  has  been  developed  for  testing 
non-nested  regression  models.  While  the  available  procedures  are  now  fre- 
quently used  for  both  testing  and  modelling  purposes,  in  many  cases  it  would 
seem  that  the  non-nested  models  are  presumed  to  have  disturbances  satisfy- 
ing the  classical  conditions  of  serial  independence  (I),  homoskedasticity  (H) 
and  normality  (N).  In  practice,  while  departures  from  the  classical  conditions 
occur  quite  frequently,  it  is  not  straightforward  to  modify  the  available  test 
procedures  to  incorporate  all  the  departures,  especially  non-normality  (N) 
of  the  disturbances.  Moreover,  in  the  nested  testing  situation,  most  of  the 
popular  tests  are  "one-directional"  in  that  they  are  designed  to  test  against 
only  a  single  alternative  hypothesis,  and  in  most  cases  the  tests  are  valid 
only  when  the  other  standard  assumptions  are  satisfied.  Many  researchers 
have  found  that  the  one-directional  tests  are  not  robust  in  the  presence  of 
other  misspecifications  [see  Bera  and  Jarque  (1982)  and  the  references  cited 
therein].  In  Section  2,  the  robustness  of  several  well  known  tests  for  both 
nested  and  non-nested  hypotheses  is  discussed  briefly.  In  Section  3,  we  de- 
velop a  procedure  for  testing  non-nested  models  together  with  simultaneously 
checking  the  sphericality  and  normality  of  the  disturbance  terms.  Locally 
equivalent  alternative  models  are  used  to  construct  joint  tests  since  they  pro- 
vide a  convenient  method  for  incorporating  more  than  one  type  of  departures 
from  the  classical  conditions.  The  joint  tests  represent  a  simple  asymptotic 
solution  to  the  "pre-testing"  problem  in  the  context  of  non-nested  linear  re- 
gression models.  In  Section  4,  we  investigate  the  finite  sample  properties 
of  our  proposed  tests  through  a  limited  simulation  study.  Some  concluding 
remarks  are  given  in  Section  5. 

2.  Robustness  of  Several  Existing  Tests 

In  testing  nested  hypotheses,  three  kinds  of  situations  can  occur,  namely 
undertesting,  overtesting  and  complete  misspecification.    When  departures 


from  the  null  hypothesis  are  multi-directional  and  an  one-directional  test  is 
used,  undertesting  is  said  to  occur.  On  the  other  hand,  in  overtesting,  the 
test  statistic  overstates  the  alternative  hypothesis.  Complete  misspecifica- 
tion  occurs  when  the  assumed  alternatives  and  the  data  generating  process 
(DGP)  are  mutually  exclusive.  In  both  undertesting  and  overtesting,  a  loss 
of  power  is  to  be  expected.  However,  joint  testing  of  several  hypotheses  is 
rarely  performed  in  practice,  so  that  inferences  are  generally  affected  by  un- 
dertesting. By  drawing  upon  some  Monte  Carlo  results  from  Bera  (1982) 
and  Bera  and  Jarque  (1982),  we  highlight  the  effects  of  undertesting  and 
the  subsequent  non-robustness  of  a  commonly  used  test  of  heteroskedastic- 
ity.  For  an  analytic  study  on  the  non-robustness  of  specification  tests  under 
different  kinds  of  misspecifications,  see  Bera  and  Yoon  (1990). 

Three  convenient  simplifying  assumptions  are  usually  made  in  standard 
regression  analysis,  namely  H,  I  and  N.  In  what  follows,  we  consider  the 
Breusch  and  Pagan  (1979)  Lagrange  multiplier  (LM)  test  for  heteroskedas- 
ticity  (H).  The  data  are  generated  under  different  combinations  of  N  (the 
t  distribution  with  five  degrees  of  freedom),  H  (additive  heteroskedasticity) 
and  serial  dependence  I  (first-order  serial  correlation);  for  further  details, 
see  Bera  (1982)  and  Bera  and  Jarque  (1982).  The  estimated  powers  of  the 
test  statistic  under  different  DGPs  are  given  in  Table  1.  The  results  are  for 
sample  size  50  and  significance  level  .10  and  are  based  on  500  replications. 
Therefore,  the  maximum  standard  errors  of  the  reported  numbers  would  be 
^.5(1  -  .5)/500  ~  .022. 

TABLE  1.  Robustness  of  the  LM  test  for  heteroskedasticity 

Nature  of  Alternative  Model 
Correct  Contaminated  Misspecified 


DGP  HIN  HIN  HIN  HIN  HIN  HIN  HIN 

Estimated  power         .726  .660  .270  .302  .084  .222  .128 


When  the  DGP  contains  only  heteroskedasticity  (that  is,  HIN),  the  LM  test  is 
asymptotically  optimal  against  the  correct  alternative  and  power  is  estimated 
to  be  .726.  However,  the  estimated  powers  fall  to  .660  and  .270  when  the  data 
are  contaminated  by  the  t  distribution  (HIN)  and  first-order  serial  correlation 
(HIN),  respectively.  The  effect  of  I  on  the  LM  test  of  H  is  quite  substantial. 
When  both  I  and  N  contaminate  the  DGP,  the  estimated  power  is  only  .302. 

The  final  three  cases  are  completely  misspecified  in  that  the  LM  test  is 
seeking  to  detect  H  when  H  does  not  exist  but,  I,  N  or  both  I  and  N  are 
present.  The  interpretation  of  the  estimated  powers  depends  on  how  the 
tests  are  to  be  viewed.  If  the  test  is  seen  exclusively  as  a  test  of  H,  then  the 
estimated  powers  should  equal  the  estimated  significance  levels  or  sizes.  It  is 
clear  from  Table  1  that  the  estimated  sizes  are  significantly  greater  than  the 
nominal  size  of  the  test  in  all  three  cases.  However,  if  the  test  is  used  purely 
as  a  significance  test,  the  estimated  powers  seem  to  be  quite  low,  and  are 
much  lower  than  those  for  the  case  where  the  alternative  model  is  contami- 
nated. Regardless  of  the  interpretation,  therefore,  the  test  does  not  perform 
well.  The  one-directional  test  is  not  robust  when  the  alternative  model  is 
contaminated  and  the  test  is  not  satisfactory  in  the  completely  misspecified 
case,  regardless  of  how  it  is  interpreted.  Here  we  have  used  Breusch  and 
Pagan  (1979)  test  for  H  for  illustration  only.  As  suggested  by  Payen  (1980) 
and  Koenker  (1981),  this  test  can  be  made  robust  by  studentizing  it.  Bera 
and  Jarque  (1981)  report  some  Monte  Carlo  results  on  the  performance  of 
this  robustified  test. 

Turning  now  to  the  case  of  non-nested  hypotheses,  extensive  Monte 
Carlo  experiments  have  been  conducted  by  Godfrey  and  Pesaran  (1983)  and 
Godfrey  et  al.  (1988).  The  first  of  these  two  papers  is  concerned  with  the 
selection  of  regressors  in  two  non-nested  linear  regression  models,  and  ex- 
amines the  Cox  test  of  Pesaran  (1974),  two  mean-  and  variance- adjusted 
versions  of  the  Cox  test,  the  J  test  of  Davidson  and  MacKinnon  (1981),  the 
JA  test  of  Fisher  and  McAleer  (1981),  and  the  standard  F  test  applied  to  the 


comprehensive  model  constructed  as  a  union  of  the  two  models.  Since  the 
tests  are  valid  only  asymptotically  when  the  disturbances  are  not  normally 
distributed,  Godfrey  and  Pesaran  (1983)  examine  the  robustness  of  the  tests 
to  errors  drawn  from  the  log-normal  distribution  and  the  chi-squared  dis- 
tribution with  two  degrees  of  freedom.  Their  experiments  indicate  that  the 
finite  sample  significance  levels  are  not  significantly  distorted  and  are  broadly 
similar  to  those  for  the  case  of  normally  distributed  errors.  Although  esti- 
mated powers  tend  to  be  greater  when  the  errors  are  drawn  from  the  two 
non-normal  distributions  compared  with  the  normal  case,  the  relative  rank- 
ings of  the  tests  in  terms  of  power  are  not  affected  by  the  non-normality. 

Various  procedures  are  considered  by  Godfrey  et  al.  (1988)  for  testing 
the  non-nested  linear  and  logarithmic  functional  forms.  The  test  procedures 
are  classified  as  non-nested  tests,  two  versions  of  the  LM  test  and  a  vari- 
able addition  test  based  on  the  more  general  Box- Cox  transformation,  and 
diagnostic  tests  of  (possible)  functional  form  misspecification  against  an  un- 
specified alternative  hypothesis.  If  the  logarithmic  model  is  to  be  taken 
seriously,  the  dependent  variable  of  the  linear  model  cannot  take  on  negative 
values  and  the  disturbances  of  the  linear  model  cannot  be  normal.  There- 
fore, it  is  essential  to  examine  the  robustness  of  the  tests  to  non-normality 
of  the  errors,  even  if  the  primary  consideration  rests  with  testing  the  non- 
nested functional  forms.  Godfrey  et  al.  (1988)  examine  the  finite  sample 
significance  levels  and  powers  of  the  tests  when  the  disturbances  follow  the 
gamma  (2,1)  distribution,  the  log- normal  distribution  and  the  t  distribution 
with  five  degrees  of  freedom.  The  two  versions  of  the  LM  test  based  on  the 
Box-Cox  model  are  found  to  be  highly  sensitive  to  non-normality  in  that  the 
estimated  significance  levels  are  far  greater  than  those  predicted  by  asymp- 
totic theory,  even  when  the  sample  size  is  eighty.  On  the  other  hand,  the 
variable  addition  tests  in  the  three  categories  are  found  to  be  robust  to  non- 
normality  of  the  errors,  and  their  relative  rankings  in  terms  of  power  are  not 
affected  by  departures  from  normality. 


3.  Joint  Tests 

The  standard  situation  for  testing  non-nested  linear  regression  models 
with  normal  and  spherical  errors  is  as  follows.  It  is  desired  to  test  the  null 
model  Ho  against  the  non-nested  alternative  H\,  where  the  two  models  are 
given  as 

H0  :y  =  X(3  +  Uo,         u0  -jV(0,<rJ/n) 

and 

Hi  :y  =  Z1  +  ul,  Ul  ~  jV(0,<7?/„), 

in  which  y  is  the  nxl  vector  of  observations  on  the  dependent  variable,  X  and 
Z  are  nxl;  and  n  x  g  matrices  of  observations  on  k  and  g  linearly  independent 
regressors,  /3  and  7  are  k  x  1  and  g  x  I  vectors  of  unknown  parameters,  and 
wo  and  u\  are  vectors  of  normally,  independently  and  identically  distributed 
disturbances.  It  is  also  assumed  that  X  and  Z  are  not  orthogonal,  and  that 
the  limits  of  n~1X'X,  n~1Z'Z  and  n~1X'Z  exist,  with  the  first  two  positive 
definite  and  the  third  non-zero.  If  X  and  Z  contain  stochastic  rather  than 
fixed  elements,  the  probability  limits  of  the  appropriate  matrices  must  exist, 
and  X  and  Z  must  be  distributed  independently  of  uq  and  u\  under  Hq  and 
H\,  respectively. 

In  considering  the  consequences  of  testing  for  certain  departures  from 
sphericality,  it  will  be  convenient  to  rewrite  the  two  models  as 

Ho  :  yt  =  x\f3  +  uot  (1) 

and 

Hi  :yt  =  2J7  +  uu,  (2) 

in  which  x't  and  z\  are  the  t'th  rows  of  X  and  Z,  respectively,  and  t  = 
1,2,  ...,n.  When  the  assumptions  regarding  uot  and  uu  are  not  satisfied, 
some  of  the  properties  of  the  tests  will  be  affected.    For  example,  Pesaran 


(1974)  derived  a  test  of  non-nested  linear  regression  models  where  the  dis- 
turbances of  each  model  follow  a  first-order  autoregressive  scheme.  How- 
ever, Pesaran's  test  is  very  complicated  to  apply  in  practice  and  a  simple 
procedure  is  given  in  McAleer,  Pesaran  and  Bera  (1990).  The  effect  of  het- 
eroskedasticity  will  be  similar.  In  practice,  a  straightforward  application 
of  the  tests  suggested  in  Davidson  and  MacKinnon  (1981)  and  Fisher  and 
McAleer  (1981)  will  not  be  valid  since  the  standard  errors  will  not  be  correct. 
However,  use  of  a  heteroskedasticity-consistent  covariance  matrix  estimator 
will  circumvent  this  problem.  When  the  errors  are  not  normal,  Pesaran's  test 
based  on  the  work  of  Cox  (1961,  1962)  is  still  valid  asymptotically,  although 
its  small  sample  properties  will  be  affected.  Normality  is  required  for  the 
test  suggested  by  Fisher  and  McAleer  (1981)  to  have  the  exact  ^-distribution 
under  the  null  hypothesis;  if  normality  does  not  hold,  the  test  will  be  valid 
only  asymptotically. 

In  the  light  of  the  above  discussion,  a  basic  requirement  for  applying 
standard  non-nested  testing  procedures  to  achieve  high  power  is  that  the 
models  under  consideration  be  well- specified.  This  means  that  tests  for  nor- 
mality and  sphericality,  for  example,  are  to  be  performed  prior  to  testing  the 
non-nested  models  themselves.  An  important,  and  frequently  overlooked,  as- 
pect of  testing  non-nested  models  in  this  two-step  procedure  is  the  effect  that 
such  "pre-testing"  may  have  on  the  levels  of  significance  and  powers  of  the 
non-nested  tests.  Therefore,  it  may  be  desirable  to  test  the  non-nested  speci- 
fications jointly  with  departures  from  the  classical  assumptions  regarding  the 
disturbance  terms.  Such  procedures  will  be  particularly  useful  when  there 
is  a  possibility  of  a  non-normal  disturbance  term  of  unknown  type,  since 
it  is  frequently  difficult  to  take  account  of  general  forms  of  non-normality 
in  a  straightforward  manner.  Joint  tests  may  be  constructed  in  a  straight- 
forward way  by  developing  an  approximate  model  which  incorporates  the 
various  departures  from  the  classical  conditions  into  the  systematic  part  of 
the  model,  so  that  the  disturbances  of  the  approximate  model  are  normally, 


independently  and  identically  distributed.  The  advantage  of  joint  tests  over 
the  two-step  testing  procedure  lies  in  the  way  the  joint  testing  procedure 
deals  with  the  "pre-testing"  problem,  at  least  asymptotically.  In  the  follow- 
ing section  we  will  study  the  finite  sample  properties  of  a  joint  test  through 
Monte  Carlo  experiments. 

In  the  context  of  deriving  LM  diagnostic  tests,  Godfrey  and  Wickens 
(1982)  suggested  a  way  of  obtaining  a  local  approximation  to  a  given  model 
with  a  non-standard  disturbance  structure.  Such  approximations  are  called 
"locally  equivalent  alternative" (LEA)  models.  As  an  illustration,  consider 
Ho  in  (1),  where  it  is  now  assumed  that  the  disturbance  uot  is  given  by 

uqi  =  PoUot-i  +  cot,      eot  ~AfID(Q,<rl),      \p0\  <  1  (3) 

for  t  =  2, 3, . . . ,  n.  A  LEA  model  to  Ho  may  be  written  as 

#o  :  Vt  =  x'tP  +  PoUot-i  +e0<,  (4) 

in  which  UQt  =  yt  —  x't(3  and  /3  is  the  ordinary  least  squares  estimate  of  j3 
under  H0.  The  models  Hq  in  (4)  and  H0  in  (1)  and  (3)  are  "equivalent"  in 
the  sense  that: 

(i)  when  po  =  0,  Ho  and  Hq  are  identical; 

(ii)  when  p0  =  0,  then  dlt(0,5o,po)/dpo  =  dl*{^al,p0)/dp0, 
where  lt(j3,a2,p0)  and  /*(/?, <72,p0)  are  the  log-density  func- 
tions for  the  t'th  observation  under  H0  and  Hq  ,  respectively. 

Godfrey  (1981)  has  shown  that,  in  testing  p0  =  0  for  local  alternatives,  the 
likelihood  ratio  test  of  p0  =  0  applied  to  H0  in  (1)  and  (3)  and  the  LM  test 
of  po  =  0  applied  to  Hq  in  (4)  have  similar  power.  Thus,  for  values  of  p0  in 
the  neighbourhood  of  zero,  H0  and  Hq  may  be  regarded  as  equivalent. 

Now  let  us  consider  (1)  and  (2)  allowing  for  the  possibility  that  the 
disturbances  uu(i  =  0, 1)  follow  stationary  autoregressive  processes  of  order 


pt(i  =  0, 1),  namely  AR(p,),  as  follows: 

Pi 
uit  =  22  PijUit-j  +eiu         i  =  0, 1 
i=i 

where  <=p+l,p  +  2,  ...,ra  and  p  =  max(po,Pi)-    In  this  case,  a  locally 
equivalent  form  of  (1)  may  be  written  as 

Po 

Hq  :yt=  x'tP  +  2J PojUot-j  +  tot, 
i=i 

where  uot  —  Vt  —  x'tfl,  as  before. 

Although  several  procedures  are  available  in  the  literature  [for  a  recent 
review,  see  McAleer  and  Pesaran  (1986)],  a  convenient  test  of  the  null  model 
against  both  the  non-nested  alternative  H \  and  AR(po )  disturbances  can  be 
performed  by  testing  a  =  poi  =  />02  =  •  •  •  =  Pop0  —  0  in  the  auxiliary  linear 
regression  required  for  the  J  test  of  Davidson  and  MacKinnon  (1981),  namely 

Po 

yt  =  x'tj3  +  ^T  pojUot-j  +  ay u  +  e«, 
i=\ 

where  y\t  is  the  predicted  value  of  yt  from  (2),  namely 

p\ 
y\t  =  z'tJ  +  ^Pifiit-j, 

"it  =  yt  —  zfi  and  7  is  the  maximum  likelihood  estimate  of  7  under  H\. 

An  attractive  feature  of  this  approach  is  that  other  departures  from  the 
classical  conditions  may  be  handled  in  an  equally  straightforward  manner. 
Consider  the  following  general  form  of  the  distribution  of  the  disturbance 
term  for  H0  of  (1),  where  uot  follows  an  autoregressive  process  of  order  po? 
namely 

PO 

uot  =  22  Pojuot-j  +  cot, 
8 


and  €ot  is  independently  distributed.  The  density  of  eot,  denoted  by  fif(eot), 
is  assumed  to  be  a  member  of  the  symmetric  Pearson  family  of  distributions. 
This  is  not  a  very  restrictive  assumption  since  this  family  encompasses  many 
distributions  such  as  the  normal,  Student  t  and  F.  The  density  of  eot  is  then 
given  by 


g(eot)  =  exp[*(eot)] 
where 


f°° 

J  —  oo 


T  -1 

,      — oo  <  eot  <  oo 


^(eot)  =   /  [-<W(cot  +  ciejjj]  deot. 


When  c\  =  0,  g(eot)  reduces  to  a  normal  density  with  mean  zero  and  variance 
Cot-  Heteroskedasticity  is  introduced  through  cot-  It  is  assumed  that 

where  the  elements  of  the  ^  x  1  vector  vt  =  (un,U2t>  •  •  • ,  vq0t)'  are  fixed 
and  measured  around  their  means,  h(-)  is  a  twice  differentiate  function 
with  h(0)  =  ctq,  and  <f>i(l  =  1,2,  ...,^o)  are  unknown  parameters.  Under 
these  circumstances,  the  disturbances  for  Hq  in  (1)  are  now  non-normal, 
heteroskedastic  and  serially  dependent.  A  simple  local  approximation  to  this 
complicated  model  may  be  written  as 

po  qo 

Ho*  ■  Vt  =  x't0  +  ^pojUot-j  +  uot  ^2  favit  +clrt+  eot,  (5) 

>=i  1=1 

in  which 


rt  =  («oi  -3u0«cF5)/(455), 

n 
~2  -1  V^  ~2 


t=l 


with  eot  ~  AfID(0,  <7§)  for  all  t. 


To  verify  whether  model  (5)  is  a  LEA  model,  we  first  note  that,  when 

p0    =    (p01,P02,-'-,POp0)'    =  0?   <t>  =   (01,^2,-  ••,0<7o)'    =   0   and   Ci    =   C2    =    0, 

Hq  in  (1)  and  ifg*  in  (5)  are  identical  and  eo<  =  uot,  so  that  condition  (i)  is 
satisfied.  Godfrey  and  Wickens  (1982)  verified  condition  (ii)  above  for  the 
case  of  serial  correlation  and  heteroskedasticity.  It  is,  therefore,  necessary 
to  consider  only  the  non-normal  component.  First,  it  can  be  shown  that 
there  is  no  contribution  from  the  Jacobian  term.  The  Jacobian  from  Cirt  is 
asymptotically  given  by  [see  Godfrey  and  Wickens  (1982,  p. 85)] 


x> 


S(u20t-a20y 
-^ — 


4<7< 


which,  under  local  alternatives,  reduces  to 

3ci 


4<7' 


^(«o*-^o)» 


which  is  Op(l).  Therefore,  for  the  purpose  of  developing  a  joint  test,  we  can 
ignore  the  Jacobian  term. 

It  can  also  be  shown  that  the  score  with  respect  to  C\  is  the  same  under 
H0  and  Hq*.  From  Bera  and  Jarque  (1982,  p. 78),  it  follows  that 

dlt(p,ZlO)  =  u$t       3 
dc  i  4<7q       4 

where  lt((3,(7Q,~Y)  is  the  log-density  function  under  H0,  with  7  =  (poi,Po2,  ■  •  ■  5 
Popo-,  <t>\i<t>2-,  •  •  •  7<^o>ci)'-  Using  the  information  that  eot  ~  7V/.D(0,  <7q)  for 
all  t,  it  follows  from  (5)  that 

dir(P,Z2o,0)         ~    ,~2 

H =  rtu0t    °Q, 

where  /**(•)  is  the  log-density  function  under  Hq*.  Since 

k    **       k     ^     ' 
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the  score  under  the  original  and  LEA  models  is  the  same.  One  component  of 
the  LM  test  for  normality  is  based  on  this  score  value  [see  Bera  and  Jarque 
(1981),  and  Jarque  and  Bera  (1987)]. 

If  it  were  desired  to  test  serial  independence,  homoskedasticity  and  nor- 
mality under  Ho,  we  would  test  the  parameter  restrictions 

Po  =  0,  <f>  =  0,  ci  =  0 

in  equation  (5).  This  joint  test  procedure  has  been  suggested  by  Bera  and 
Jarque  (1982).  However,  they  did  not  consider  the  possibility  of  the  non- 
nested alternative  H\  together  with  the  non-sphericality  and  non-normality 
of  the  disturbance  term  under  Hq . 

If  suitable  predictions  y\t  from  a  non-nested  alternative  H\  are  aug- 
mented to  equation  (5)  to  yield  the  auxiliary  regression  given  by 


Po  go 

Vt  =  x'tP  +  ^2  Pofiot-j  +  uot  ]P  <f>ivu  +  cirt  +  ayu  +  eot,  (6) 

3=1  1=1 


then  the  null  hypothesis,  namely  equation  (1)  with  uq<  =  eo<?  involves  a  joint 
test  of 

H  :  po  =0,^  =  0,ci  =0,  a  =  0.  (7) 

This  joint  hypothesis  can  be  tested  by  applying  the  LM  procedure,  for  ex- 
ample, directly  to  equation  (6)  or  by  using  an  appropriately  adjusted  F  test 
[see  Godfrey  and  Wickens  (1982)].  In  computing  the  LM  test,  the  most 
convenient  form  is  nR2 ,  that  is,  the  sample  size  times  the  (uncentered)  coef- 
ficient of  determination  in  the  auxiliary  regression  of  a  vector  of  ones  on  the 
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following  variables: 
dl 


dp 


=  xtuot/a0 


'  =(Z2t-Z2)/(2Z*) 


da2 

=  U0tU0t-j/(TQ,  j  =  V2,  ...,po 


a/? 


a!-  =  (uotvit/0o)  -  Vlt,  Z=l,2,...,tfo 

^-  =  (r^0t/a02)  -  3(u2t  -  a2)/(4a04) 


and 


ft/** 

"a—  =  yuUot/crQ. 
The  set  of  regressors  given  above  can  be  simplified  to 
(x'tuot,uot  -  (70,u0tU(H_i, . . .  ,uotuot-PQ,(uot  -  a0)vu, . .  . , (uot  -  aQ)vqoU 

Although  this  nR2  version  of  LM  test  is  very  convenient  to  use,  it  often 
leads  to  statistics  that  behave  badly  in  finite  samples  [see  Davidson  and 
MacKinnon  (1983)  and  Bera  and  McKenzie  (1986)]. 

As  noted  by  Godfrey  and  Wickens  (1982, p. 86),  it  is  not  valid  to  apply 
the  standard  form  of  the  F  statistic  to  (6)  in  testing  H  in  (7).  For  example, 
Godfrey  and  Wickens  have  shown  that,  for  testing  (f>  =  0,  the  usual  regres- 
sion formula  omits  the  factor  2  arising  from  the  asymptotic  distribution  of 

n1'2  y^uot(uotvit).    Consider  now  the  F  statistic  for  testing  C\  =  0.    The 

t 
asymptotic  variance  of  n-1/2  Jy    uotrt  is  the  same  as  that  of 

t 

n~1/2  X>3,  -  3uS(a02)/(4a02). 


12 


Under  H,  uot  =  eot  ~  NID{0,  a2,)  for  all  t.  Therefore,  the  above  variance 
can  also  be  expressed  as 

[E{ul)  -  {E{u4t)f  +  9v4(E(u4t)  -  E(u0t)2) 

-  6a2(E(u60t)  -  E(u40t)E(u20t))]/(l64) 
=  [(105<708  -  9<t08)  +  9<704(3<704  -  a4)  -  6<r02(15<706  -  3<706)]/(16<704) 
=  21(74/8. 

However,  the  limiting  value  of  the  regression  formula  is 

(Toplim  n_1  22(uot  ~  3u0t<To)2/(16crJ)  =  3<r4/8, 
t 

which  is  one-seventh  of  the  correct  asymptotic  variance. 
Denote  the  standard  F  statistics  for  testing 

(po  =  0,  a  =  0),  (f>  =  0     and     c\  =  0 

by  F\ ,  i*2  and  Fz ,  respectively.  Note  that  the  regressors  corresponding  to  the 
above  parameters  are  asymptotically  uncorrelated  with  each  other,  and  also 
the  regressors  with  coefficients  <f>  and  c\  are  asymptotically  orthogonal  to  the 
regressors  of  the  null  model.  Therefore,  the  conditions  for  the  decomposition 
of  the  joint  test  are  satisfied  [see  Godfrey  (  1988,  p. 79)].  It  follows  that, 
under  Ho, 

(po  +  l)*i  +  2*0*2  +  jF3  -^  X2(Po  +  qo+  2). 

This  test  statistic  will  test  the  standard  linear  model  (1)  with  normal  and 
spherical  disturbances  against  a  broader  alternative  of  non-spherical  and  non- 
normal  disturbances,  as  well  as  against  a  non-nested  alternative.  Depending 
on  the  situation,  it  is  possible  to  specialize  the  test  statistic  to  particular 
alternatives  by  retaining  the  appropriate  regressors  in  equation  (6).  For 
example,  to  test  the  null  model  H0  against  a  non-nested  alternative  Hi  and 
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the  possible  presence  of  heteroskedasticity,  it  would  be  necessary  to  test  <f>  =  0 
and  a  =  0  in  the  auxiliary  regression  given  by 

yt  =  x'tfi  +  uot  22  <t>ivlt  +  ayu  +  eot- 

This  auxiliary  regression  equation  is  simply  a  specialization  of  equation  (6) 
with  poj  =  0     (j  =  1,2, . . .  ,po)  and  c\  =  0. 

4.  Monte  Carlo  Results 

As  mentioned  in  Section  2,  extensive  Monte  Carlo  results  have  been  re- 
ported in  the  literature  concerning  the  violation  of  the  classical  conditions 
in  testing  both  nested  and  non-nested  hypotheses.  However,  in  the  econo- 
metrics literature  these  two  types  of  hypothesis  testing  have  been  studied 
separately.  Here  we  shall  concentrate  on  a  particular  departure  from  the 
classical  conditions,  i.e.,  serial  dependence  (I)  (first-order  serial  correlation), 
together  with  the  non-nested  alternative.  This  specialized  form  of  the  joint 
test  is  easily  obtained  by  setting  poj  =  0  (j  =  2,3, ...,po),  <t>i  =  0  (/  = 
1, 2, . . .  ,qo)  and  c\  =  0  in  equation  (6)  to  give  the  auxiliary  regression 


yt  =  x't0  +  /90wot-i  +  ayu  +  eot. 


The  joint  hypothesis  of  H  :  /?o  =  0,  a  =  0  can  be  tested  by  computing  the 
standard  F  statistic  for  H  and  then  noting  2F  — ►  x2(2)  under  the  null 
hypothesis. 

For  comparison  with  our  joint  test  we  also  consider  the  J  test  of  David- 
son and  MacKinnon  (1981),  which  involves  testing  a  =  0  in  the  auxiliary 
regression  given  by 

Vt  =  x'tfi  +  ayu  +  eot- 

The  J  test  can  be  carried  out  by  the  standard  t  test.  We,  however,  performed 
this  procedure  using  a  X2(l)  test. 
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The  data  were  generated  by  the  following  non-nested  linear  regression 
models  with  AR(1)  disturbances: 


4 

Ho  :yot  =  ^2  xtjP]  +  uot  (8) 


and 

4 

Hi  :  yu  =  }] ztjij  +  uu  (9) 

with 

uu  =  Piun-i  +eiu         i  =  0, 1 

where  we  set  xt\  =  1  and  generated  xt2  from  ^(10,  25),  xtz  from  the  uniform 
distribution  in  the  interval  [7.5,  12.5]  and  xt4  from  xio-  To  generate  the  ztJ 
values,  we  set  ztJ  =  xtj,  j  =  1,2,3  and  obtained  zt4  from  A/*(10,20).  These 
values  of  the  design  matrices,  X  and  Z,  were  kept  fixed  from  one  replication 
to  another.  Serially  correlated  disturbances  were  generated  from  the  AR(1) 
process  by  setting  p0  =  p1  =  .3  and  p0  =  Pi  —  -4.  e^'s  (i=0,l)  were  obtained 
from7V(0,25). 

In  every  experiment,  for  a  given  sample  size  n,  we  calculated  the  above 
X2(l)  and  x2(2)  statistics  under  the  two  non-nested  models,  H0  and  Hi, 
both  with  AR(1)  disturbances.  The  experiments  were  performed  for  sample 
sizes  n  =  40,50,  70  and  100,  and  for  each  n,  we  carried  out  500  replications. 
The  results  are  presented  in  Tables  2  and  3  using  the  nominal  level  of  .05. 
The  reported  numbers  in  the  tables  are  proportion  of  rejections  of  the  model 
yt  =  x't(3  +  ut  which  has  no  serial  correlation  when  the  true  model  is  either 
H0  or  Hi  as  given  in  equations  (8)  and  (9). 
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TABLE  2.  Proportion  of  rejections  when  po  =  P\  =  -3 


DGP 

Ho 

#0 

#i 

#i 

x2(i) 

X2(2) 

x2(D 

X2(2) 

n  =  40 

.048 

.288 

.632 

.688 

n  =  50 

.060 

.370 

.780 

.820 

n  =  70 

.098 

.544 

.828 

.918 

n  =  100 

.222 

.738 

.948 

.982 

TABLE  3.  Proportion  of  rejections  when  p0  =  p1  =  A 


DGP 

Ho 

H0 

#i 

#i 

x2(D 

X2(2) 

x2(i) 

X2(2) 

n  =  40 

.044 

.494 

.700 

.844 

n  =  50 

.074 

.612 

.658 

.904 

n  =  70 

.112 

.782 

.648 

.942 

n  =  100 

.200 

.932 

.892 

.998 

When  the  DGP  is  given  by  the  null  model  H0  with  serially  correlated 
disturbances,  the  estimated  proportions  for  the  x2(l)  ^es^  require  careful 
interpretation.  The  situation  is  analogous  to  the  completely  misspecified 
cases  discussed  in  Section  2  in  that  the  x  (1)  test  is  designed  to  detect  non- 
nested component  in  the  design  matrix  when  the  design  matrix  is  correctly 
specified  but  serial  correlation  (I)  is  present.  If  we  view  the  x  (1)  test  as  a 
non-nested  test,  then  we  find  that  the  sizes  of  the  test  are  too  high  especially 
for  sample  sizes  n  =  70  and  100.  If  we  regard  this  as  a  significance  test,  then 
the  test  is  not  powerful  enough.  Our  X2(2)  is  a  two  directional  test  in  the 
directions  of  non-nested  alternatives  and  serial  correlation.  When  the  DGP 
is  Ho,  X2(2)  does  pick  up  serial  correlation  with  reasonable  power.  However, 
there  is  some  loss  of  power  due  to  'overtesting',  since  the  X2(2)  joint  test 
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seeks  to  detect  both  alternative  mean  specification  and  I,  when  only  I  is 
present. 

Turning  to  the  cases  where  the  data  are  generated  under  the  non-nested 
alternative  H\  as  in  equation  (9),  it  is  again  observed  that  serial  dependence 
(I)  has  adverse  effect  on  the  power  of  the  X2(l)  test.  As  clearly  seen  from 
Table  2  the  x2(l)  test  1S  less  powerful,  for  all  n,  than  the  x2(2)  test  under 
the  same  DGP.  This  is  the  problem  of  'undertesting'  which  is  common  to 
one-directional  tests  as,  in  our  cases,  the  X2(l)  test  can  only  detect  the  non- 
nested part  but  not  I.  Finally,  we  observe  that  the  joint  x2(2)  test  has 
very  good  power  under  this  setup  because  this  is  the  situation  for  which  the 
statistic  is  designed.  And  in  this  situation  we  observe  substantial  power  loss 
if  the  x2(l)  test  is  used  in  place  of  our  x2(2)  test. 

The  results  in  Table  3  are  for  p0  =  p1  =  A.  Under  the  DGP  Ho,  the 
X2(l)  test  has  similar  rejection  proportions  as  in  Table  2.  However,  under 
DGP  Hi ,  it  has  overall  lower  power  compared  to  p0  =  pi  =  .3  case.  This  is 
due  to  the  fact  that  the  X2(l)  test  does  not  take  account  of  serial  correlation 
at  all  and  po  =  pi  =  A  has  more  adverse  effect  than  p0  =  p1  =  .3.  The 
attractiveness  of  x2(2)  test  is  also  clearly  revealed  from  Table  3  results. 
Under  DGP  Hq,  the  x2(2)  test  detects  serial  correlation  50  percent  of  the 
time  even  for  sample  size  40.  And  when  the  departure  is  two-directional,  i.e., 
under  DGP  Hi,  the  powers  of  the  x2(2)  test  are  very  close  to  one  even  for 
small  sample  sizes.  This  limited  Monte  Carlo  study  could  easily  be  extended 
in  various  directions.  However,  what  we  presented  is  very  much  indicative 
of  the  general  finite  sample  behaviour  of  our  proposed  joint  test. 

5.  Conclusion 

In  this  paper  we  have  presented  some  simple  joint  tests  of  non-nested 
models  and  general  error  specifications.  The  joint  tests  for  non-nested  spec- 
ifications and  for  one  or  more  departures  from  the  classical  conditions  of 
serial  independence,  homoskedasticity  and  normality  were  developed  within 
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the  context  of  locally  equivalent  alternatives.  These  tests  represent  a  simple 
asymptotic  solution  to  the  "pre-testing"  problem  as  applied  to  non-nested 
linear  regression  models.  Results  of  our  limited  Monte  Carlo  study  demon- 
strate that  the  proposed  tests  have  good  finite  sample  properties.  If  the  null 
hypothesis  is  not  rejected  by  the  joint  tests,  standard  regression  analysis 
would  follow  for  the  underlying  null  model.  However,  if  the  null  is  rejected, 
it  is  not  possible  to  infer  whether  it  is  rejected  because  of  the  non-nested 
alternative  or  through  departures  from  the  classical  conditions  regarding  the 
disturbances. 
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